A General Framework for Image Kernel Engineering
نویسنده
چکیده
Understanding image content is a long standing problem of computer science. Despite decades of research in computer vision, an effective solution to this problem does not appear to be in sight. Recent advances in the theory of learning by examples indicate that devising systems which can be trained instead of programmed to solve this problem is an interesting alternative to solutions constructed from higher level image analysis and description. In this thesis we consider a number of image understanding problems viewed as classification problems for which a certain number of input/output pairs is given. Within the statistical learning schemes we adopt (binary support vector machines and one-class support vector machines), the solution to each problem is written as a linear combination of certain functions, named kernel functions. These functions, which satisfy some specific mathematical properties, are evaluated on input pairs and encode the prior knowledge on the problem domain. Roughly speaking, kernel functions can be thought of as measuring the similarity between input pairs by extracting certain features from the raw data. In this thesis we argue for the need of finding appropriate kernel functions for building effective trainable systems. Thus, we proceed to investigate, design, implement, and validate kernels for images, or engineer kernels, in this context. In the case of images, the problem of kernel engineering cannot be easily decoupled from the choice of the image description. Therefore, we make use of different descriptions depending on the nature of the considered classification problem. For classification problems like indoor/outdoor classification we choose to represent images through histograms (color, edges, co-occurrences, etc.), while for view-based object recognition we choose grey values and/or wavelets representations. In our work we introduce and study the mathematical properties of two image kernels: the Histogram Intersection kernel and the Hausdorff kernel. The Histogram Intersection kernel, derived from a similarity measure widely used in the computer vision community for color based object recognition, is found to be a very effective kernel for describing similarities between images in high level classification problems. The Histogram Intersection kernel can be implemented efficiently and does not require the introduction of additional parameters. The Hausdorff kernel, instead, which we derive as a specialization of a larger class of kernels defined on binary strings, appears to be well suited for measuring the similarity between image patches. We show that the Hausdorff kernel can be used to boost the performances of trainable 3D object detection systems trained on positive examples only. The obtained experimental results confirm that the choice of the appropriate kernel can make the difference for a specific application. We conclude by discussing strengths and weaknesses of the approach and outlining directions for future work.
منابع مشابه
A General Framework for 1-D Histogram-baesd Image Contrast Enhancement
In this paper, a general framework for image contrast enhancement algorithm based on an optimization problem is presented. Through this optimization, the intensities can be better distributed. The algorithm is based on the facts that the histogram of the enhanced image is close to the input image histogram and uniform distribution, simultaneously. Based on this fact, we obtain a closed form opt...
متن کاملUsing a Novel Concept of Potential Pixel Energy for Object Tracking
Abstract In this paper, we propose a new method for kernel based object tracking which tracks the complete non rigid object. Definition the union image blob and mapping it to a new representation which we named as potential pixels matrix are the main part of tracking algorithm. The union image blob is constructed by expanding the previous object region based on the histogram feature. The pote...
متن کاملFisher’s Linear Discriminant Analysis for Weather Data by reproducing kernel Hilbert spaces framework
Recently with science and technology development, data with functional nature are easy to collect. Hence, statistical analysis of such data is of great importance. Similar to multivariate analysis, linear combinations of random variables have a key role in functional analysis. The role of Theory of Reproducing Kernel Hilbert Spaces is very important in this content. In this paper we study a gen...
متن کاملISAR Image Improvement Using STFT Kernel Width Optimization Based On Minimum Entropy Criterion
Nowadays, Radar systems have many applications and radar imaging is one of the most important of these applications. Inverse Synthetic Aperture Radar (ISAR) is used to form an image from moving targets. Conventional methods use Fourier transform to retrieve Doppler information. However, because of maneuvering of the target, the Doppler spectrum becomes time-varying and the image is blurred. Joi...
متن کاملImproving Super-resolution Techniques via Employing Blurriness Information of the Image
Super-resolution (SR) is a technique that produces a high resolution (HR) image via employing a number of low resolution (LR) images from the same scene. One of the degradations that attenuates performance of the SR is the blurriness of the input LR images. In many previous works in the SR, the blurriness of the LR images is assumed to be due to the integral effect of the image sensor of the im...
متن کاملUtilizing Kernel Adaptive Filters for Speech Enhancement within the ALE Framework
Performance of the linear models, widely used within the framework of adaptive line enhancement (ALE), deteriorates dramatically in the presence of non-Gaussian noises. On the other hand, adaptive implementation of nonlinear models, e.g. the Volterra filters, suffers from the severe problems of large number of parameters and slow convergence. Nonetheless, kernel methods are emerging solutions t...
متن کامل